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Abstract: 

Model complexity in amplitude analyses is often a priori under-constrained since the underlying 
theory permits a large number of possible amplitudes to contribute to most physical processes. The 
use of an overly complex model results in reduced predictive power and worse resolution on un¬ 
known parameters of interest. Therefore, it is common to reduce the complexity by removing from 
consideration some subset of the allowed amplitudes. This paper studies a method for limiting 
model complexity from the data sample itself through regularization during regression in the con¬ 
text of a multivariate (Dalitz-plot) analysis. The regularization technique applied greatly improves 
the performance. An outline of how to obtain the significance of a resonance in a multivariate 
amplitude analysis is also provided. 
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1. Introduction 

Model complexity in amplitude analyses is often a priori under-constrained since the underlying 
theory permits a large number of possible amplitudes to contribute to most physical processes. 
The use of an overly complex model results in reduced predictive power and worse resolution on 
unknown parameters of interest, e.g., resonance parameters or the contribution from a particular 
amplitude. Therefore, it is common to reduce the complexity by removing from consideration 
some subset of the allowed amplitudes. For example, often times only the lowest few possible 
values of orbital angular momenta are considered. In some cases ad hoc procedures are applied in 
an attempt to make this reduction in complexity unbiased. Such procedures include starting from 
a simple model and considering one additional amplitude at a time and labeling the amplitude as 
"unimportant" if the or likelihood does not change by a pre-defined "large enough" amount. This 
stepwise model reduction is well known to not be optimal []TI] . Furthermore, it is not well suited for 
amplitude analysis due to the fact that the most important contribution from an amplitude may be 
due to its interference with another amplitude. The stepwise procedure risks never considering two 
interfering amplitudes in the same model. This paper considers alternatives for selecting from data 
a set of amplitudes to use in the fit model in the context of a Dalitz-plot analysis. 

2. Example analysis 

The model we consider in this paper is production of an abc final sfafe, where a, b and c are 
pseudoscalars {i.e., fhey have spin-parify 0^) wifh masses = ni}, = nic = 0.1 in arbifrary unifs. 
This final sfafe will firsf be sfudied in bins of abc invarianf mass, mabc, wifh fhe fifs performed in 
each bin being independenf. The particles fhaf decay info fhe abc final sfafe are generically labeled 
as X, e.g., fhe reacfion mighf be yp —>• pX,X —)■ abc. The decay amplifudes are consfrucfed in fhe 
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Table 1. The set of resonant terms and properties used in the true p.d.f. of Model I for the Dalitz-plot 
example. The symbols are: 7^, mx, M the spin-parity, mass, width and spin projection of X; j is the 
isobar spin; and / is the orbital angular momentum between the isobar and the bachelor particle. 


isobar formalism with the observed intensity written as the coherent sum of resonant terms plus a 
non-resonant term added incoherently as follows: 


M 


aue 


+ Wn 


(2.1) 


where x = {m^b’^ac) represents the position in the Dalitz plot, ake"^'‘ describes the unknown com¬ 
plex factor for each resonant component of the model, denoted by the index k, and M denotes the 
spin substates of the various X particles ^ Table lists the properties and decay channels of the res¬ 
onances contained in the true probability density function (p.d.f.) for the primary model considered 
in this paper, referred to as Model I. The resonant terms contain contributions from Blatt-Weisskopf 
barrier factors 0], relativistic Breit-Wigner line shapes to describe the propagators, and spin fac¬ 
tors obtained using the Zemach formalism [0. The amplitudes are evaluated using the qft-t-i- 
package 0]. 

The key to this type of analysis is the fact that the Dalitz-plot distributions can be used to 
separate out the various partial-wave contributions. This can be done in each triabc bin without 
the need to assume some resonant-like behavior. A second step in such an analysis is to fit the 
magnitudes and phase differences observed niatc to determine if resonances do contribute and, 
if so, extract values for their pole parameters. 

Figure |I] a) shows the invariant mass distribution of the abc system for a simulated data set 
containing ~ 4.4 x 10^ events. There is clear structure from the resonant amplitudes in the model. 
The Dalitz plot distributions for a few niabc bins of Fig. |I] a) are shown in panels b) - f); these 
demonstrate the variation of the resonant structure with rriabc in the generated model. The goal 
of the amplitude analysis in this example is to determine the complex parameters, from an 

extended maximum likelihood fit to the data performed in 0.025-wide bins of mate- This so-called 
mass-independent fit is typical in hadron spectroscopy, where it is desirable to avoid parameteri¬ 
zation of the produced X resonances. The mass dependence of the ake“^'‘ factors is then studied 
separately to search for new resonances or to measure properties of known ones. An ensemble of 
simulated data sets that contain 500 — 25 000 events per Mate bin is used in this study, where the 
typical number of events in a given niabc bin is shown in Figure |T] a). 

*We drop the explicit lUabc dependence on the right side of Eq. 0. All quantities, including the complex constants, 
depend on niabc- 
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Figure 1. a) Invariant mass distribution of the abc system for one simulated data set overlaid with the 
contributions from the individual resonant amplitudes, b) - f) Dalitz plot distributions in a few bins of mate- 


In all cases the fit minimizes the quantity —2 log .if, where .if is the extended likelihood func¬ 
tion obtained from Figure ^ shows the results of a mass-independent fit where the p.d.f. 

used in the fit contains only the amplitudes from the true model {i.e., those in Table [I]). The inten¬ 
sities of the individual amplitudes are accurately and precisely determined, and the clear resonance 
structure from the Breit-Wigner propagators is apparent in the extracted phase differences between 
resonant amplitudes in the coherent sum. 

In general, however, the true set of amplitudes to use is not known a priori, and the full p.d.f. 
used in the fit could contain an arbitrarily large number of amplitudes. In this example, the p.d.f. 
is expanded to contain an additional 35 resonant amplitudes with additional allowed combinations 
of J,M,j and I listed in Table Figure 0 shows the results of an extended maximum likelihood 
fit to the same data set but using the expanded set of amplitudes. The performance of the fits has 
clearly deteriorated. The variance of the estimators of the true model parameters has in many places 
greatly increased. 

Additionally, the fits incorrectly determine that some of the 35 extraneous amplitudes (those 
not in the true p.d.f.) make significant contributions. The contribution from each resonant ampli¬ 
tude k in the tit can be represented by the tit fraction, defined as / r/x/ f \^{x)\^dx. 

The cumulative fraction of extraneous amplitudes is then defined as the fraction of the 35 extrane¬ 
ous amplitudes with a fit fraction larger than a given threshold, which is shown in Fig. ^ vs mate- 
In bins of niatc with low statistics, there are clearly sizable contributions from multiple extraneous 
amplitudes. 
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Figure 2. Results of extended maximum likelihood fit with the true p.d.f. from Table 1: a) - f) intensities for 
each resonant amplitude in the true p.d.f. labeled J^[Ml j] and g) - i) phase differences between each resonant 
wave and the corresponding reference wave with the same M. Closed symbols represent the measured 
intensities and phase differences, while the true model values are indicated by the solid curves. 
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Table 2. The set of extraneous resonant amplitudes and properties used in the full p.d.f. for the Dalitz-plot 
example. The symbols are: and M the spin-parity and spin projection of X; j is the isobar spin; and I is 

the orbital angular momentum between the isobar and the bachelor. 
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Figure 3. Results of extended maximum likelihood fit with the full p.d.f., including the additional 35 extra¬ 
neous amplitudes, and no LASSO regularization (i.e. A = 0): a) - f) intensities for each resonant amplitude 
in the true p.d.f. labeled J^[Ml j] and g) - i) phase differences between each resonant wave and the corre¬ 
sponding reference wave with the same M. Closed symbols represent the measured intensities and phase 
differences for the full p.d.f, open symbols represent the intensities determined from the true p.d.f. ht in 
Fig. 2, and the curves indicate the true model values for the phase difference. 


3. Method 

Identifying a "good" model consists not only of adequately describing the data, e.g., using one 
of the multivariate goodness-of-fit methods described in detail in Ref. [@1, but also in properly 
selecting the subset of amplitudes to include. A common approach taken to address this problem 
involves stepwise addition or subtraction of amplitudes from the model (for a detailed description 
of such methods, see Ref. [0); however, this approach is known to often produce an over-simplified 
modelOH]. Furthermore, the stepwise approach requires vast CPU resources for large data sets. 

An alternative approach is to regulate the regression procedure. A popular choice (outside 
of particle physics) is the Least Absolute Shrinkage and Selection Operator (LASSO)|I7|]. The 
LASSO imposes a so-called Li-type regularization condition on the sum of the absolute values of 
the regression parameters, which tends to produce sparse models; i.e., the LASSO tends to produce 
models with few significantly non-zero regression parameters. The use of the sum of the squared 
values of the regression parameters is referred to as L 2 -type regularization and is used, e.g., in so- 
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Figure 4. Cumulative fraction of extraneous amplitudes with fit fraction greater than y-axis value vi iriabc- 
The (red) line shows 1 /\/N, where N is the total number of events in the niabc bin. This is shown just to give 
a sense of statistical precision in the bin. 


called Ridge regression. Since Ridge regression uses the squared parameter values, applying it to 
an amplitude anlaysis would remove excessive interference; however, it would not drive unphysical 
model parameters to zero, and so is not as well suited to the task studied in this paper. 

Applying the LASSO to this analysis involves modifying the function that is minimized in the 
fit as follows: 


—21og.if + A 


I J Wnrl^dx , 


(3.1) 


where A is the LASSO regularization parameter. Our choice of LASSO penalty term does not 
depend on the relative normalization of the (x) terms and has the advantage that it requires 
almost no modification to existing fitting software^. 

A proper choice of A will reduce the number of amplitudes contributing to the fit result, similar 
to what is done explicitly with ad hoc procedures when not using regularization. However, the sub¬ 
set of amplitudes selected by the LASSO is that which minimizes Eq. which does not require 
independently running fits with all combinations of possible sets of amplitudes. Roughly speaking, 
the LASSO can be said to find fhe optimal model to describe the data with the parameter A setting 
the desired maximum amount of model complexity, but not explicitly setting a fixed number of co- 
efficienfs fo zero. Nofe thaf when A —0 fhe LASSO solufion reduces fo fhe nominal un-penalized 
one. Conversely, when A —oo the LASSO produces a model where few (if any) coefficienfs are 
nonzero. How fo choose the value for A is discussed below. 

Figure ^ shows the results of fitting an ensemble of 100 data sets in a single niabc bin {niabc = 
1.25) using values of A from 0 to 100. As A is increased from zero the contributions due to 
extraneous amplitudes decrease and the precision on the parameters of the amplitudes in the true 
model increases. As A approaches 100, however, even the fit fractions of the amplitudes that are in 


^Note that the penalty term is calculated using only information already required to calculate the nominal function 
minimized in the fit. 
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the true model are driven towards zero. Knowing the true model, it is clear to see that the optimal 
value of A is 1) in this example. 

The final puzzle piece then is to determine how much complexity in the model is optimal. 
This involves balancing improvements in the nominal minimization quantity (—2 log .if) with ad¬ 
ditional complexity in the model. There are many methods in the literature for choosing X using the 
data sample itself. Two considered in this paper are the Akaike and Bayesian information criteria 
(AIC [H and BIC [i]): 

AIC(A) =-2 log .if+ 2r, BIC(A) = -21og.if + rlogn, (3.2) 

where n is the number of events in the data set and r is related to the number of amplitudes in the 
model that have a fit fraction sizably different from zero {r is discussed in detail in the examples 
below). Therefore, both AIC and BIC take into account both improvements in the goodness-of-fit 
of the data and complexity of the model. In our studies below both AIC and BIC perform well. 

Combining the LASSO method with a chosen information criterion permits performing the 
amplitude selection procedure quickly using the data sample itself. The full procedure to be applied 
to a given data set is as follows: 

• choose the set of all possible amplitudes to be considered for inclusion in the model; 

• for any X , minimize Eq. o to obtain —2 log .if and the regression parameters; 

• calculate AIC(A) or BIC(A) with determination of r discussed below; 

• scan X values to find fhe one fhaf minimizes AIC or BIC. 

The value of X fhaf minimizes fhe chosen informafion criterion, X, is selected. Since penalty terms 
are added to the minimization quantity, a scan of the profile likelihood may no longer produce 
proper confidence infervals for fhe regression paramefers. In fhis paper fhe variances in regression 
parameters are defermined using fhe boofslrap [HIHl . Boofsfrap dafa sefs are resampled from fhe 
original, and fhe enfire procedure - including opfimizing A - is repealed on each copy dafa sef 
independenlly. An alternative approach is fo use fhe LASSO lo choose fhe model, and fhen rerun 
fhe til using fhe selected waves buf no penally lerm. 

In fhis study, we have chosen r to be the number of resonant amplitudes with a fit fraction 
larger than 10^^. Since each resonant amplitude has both an unknown magnitude and a phase, 
or equivalently real and imaginary factors, we also considered taking r to be twice the number of 
resonant amplitudes with a fit fraction greater than 10^^ but we see almost no change in the choice 
of optimal A. Furthermore, we tried increasing and decreasing the 10^^ threshold by a factor of 10 
but also see very little change in the choice of optimal A. 

Figure ^ shows the average AIC and BIC from the ensemble of datasets used to produce Fig. ^ 
as a function of A. The optimal AIC and BIC both occur near A = 1. By construction, the BIC 
method chooses a slightly larger A value and, thus, on average a less complex model. Figure 0 
shows the optimal value of A chosen for 100 independent data samples. In a real analysis, we 
suggest to try both AIC and BIC and use the difference in results obtained as part of the estimate 
for the systematic uncertainty due to model selection. 



Figure 5. Results of fitting an ensemble of 100 data sets in a single mass bin with niabc = 1-25. a) Fit 
fractions for each of the resonant amplitudes in the full p.d.f. as a function of X, where the width of each 
band represents the standard deviation obtained from the 100 data samples in the ensemble, b) is the same as 
a), but with a zoomed y-axis to better display the growing contributions of the extraneous resonant amplitudes 
as A —^ 0. c) Percentage of extraneous resonant amplitudes with fit fraction > X for A = 0, 0.1, 0.4, 0.8, 2, 10 
(increasing from light grey to black); this demonstrates the reduction in extraneous amplitudes contributing 
to the fit as A is increased. 



Figure 6. Average a) AIC and b) BIC as a function of A for the ensemble of 100 independent samples with 
^abc — 1.25. 


Figure ^ shows the results of the mass-independent fit where the BIC-LASSO procedure is 
used to select the optimal A in each mate bin separately. The amplitudes from the true p.d.f. are 
more accurately determined in this case, and as seen in Fig. 0 the number of extraneous amplitudes 
with non-zero fit fractions is dramatically reduced. For this Dalitz-plot analysis, using the BIC- 
LASSO produces results similar to those of using only the true model p.d.f. which, of course, is 
not known in the analysis of real data. This is an impressive result given that we allowed the 
initial set of amplitudes considered to be large. An important point to make here is that this model 
simplification was achieved without direct human intervention during the process. 
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Figure 7. Distribution of X values which minimize a) AIC and b) BIC for the ensemble of 100 independent 
samples with mate = 1-25. 



Generated 

True p.d.f. 

No LASSO 

BIC-LASSO 

1 + [0S1] 

41.74 

41.78 ±0.76 

40.37 ± 1.73 

41.83 ±0.86 

2-[052] 

10.55 

10.12 ±0.52 

10.58 ± 0.96 

10.11 ±0.71 

2-[0Pl] 

49.21 

48.67 ± 0.91 

53.40 ± 5.87 

48.04 ± 2.62 

2-[lDl] 

86.58 

86.64 ± 0.40 

86.47 ± 1.01 

86.5 ± 0.77 

1 + [151] 

1.77 

1.90 ± 0.23 

1.62 ±0.42 

1.60 ±0.40 


Table 3. Mean and standard deviation of the measured fit fractions for Model I. 


To evaluate the accuracy of the measured intensities from the fit, we study the measured fit 
fractions in the niabc bin with the largest intensity in the true p.d.f. for an ensemble of 1000 datasets 
without using the LASSO procedure (i.e. A = 0) and with the LASSO procedure using BIC to 
select the best X for each data sample and ruabc mass bin, respectively. Table 0 shows that the 
measured fit fractions are more precisely determined and less biased when using the BIC-LASSO 
procedure. 

In reality the true model is likely to contain many non-zero but small resonant contributions 
that the analyst is not interested in. To study this case, an alternate model with six additional 
resonant amplitudes with fit fractions all below 1% (see Table ^ is used to study the effectiveness 
of the BIC-LASSO procedure when there are a number of low intensity amplitudes in the true 
p.d.f. Throughout the remainder of the paper this alternate model is referred to as Model II. The 
same measured fit fraction distributions as in Table 0 are shown here in Table 0 for Model II. The 
amplitudes in the true p.d.f. remain unbiased when using BIC-LASSO even in the presence of many 
additional low intensity amplitudes. 

4. Identification of resonances 

In this section we present an outline for how to develop a procedure for identifying resonant struc¬ 
tures in the results of the mass-independent fit from the previous section. The goal of such a 
procedure is to search for new X resonances or measure the properties of known resonances in the 
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Figure 8. Results of extended maximum likelihood fit with the full p.d.f., including the additional 35 extra¬ 
neous amplitudes, and using BIC to select the optimal A for each iriahc bin; a) - f) intensities for each resonant 
amplitudes in the true p.d.f. labeled J^[Ml j] and g) - i) phase differences between each resonant wave and 
the corresponding reference wave with the same M. Closed symbols represent the measured intensities and 
phase differences for the full p.d.f. using the LASSO, open symbols represent the intensities determined 
from the true p.d.f. ht in Fig. 2, and the curves indicate the true model values for the phase difference. The 
agreement between the ht results using the true model p.d.f. vi the full model using BIC-LASSO is good 
enough that is difficult to see both results in these plots since they overlap. 
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Table 4. The set of resonant terms and properties (in addition to those in Table 1) used in the true p.d.f. for 
Model II. The symbols are: J^, nix, Txj M the spin-parity, mass, width and spin projection of X; j is the 
isobar spin; and I is the orbital angular momentum between the isobar and the bachelor. 
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Figure 9. Cumulative fraction of extraneous amplitudes with fit fraction greater than y-axis value vi niabc 
using BIC to select the optimal A for each iriabc bin. The (red) line shows 1 / ^/N, where N is the total number 
of events in the niabc bin. This is shown just to give a sense of statistical precision in the bin. 


jP[Mlj] 

Generafed 

True p.d.f. 

No LASSO 

BIC-LASSO 

1 + [0S1] 

41.76 

41.11 ±0.76 

39.79 ± 1.19 

40.75 ± 0.79 

2-[052] 

9.99 

9.99 ± 0.47 

10.89 ± 1.01 

10.65 ± 0.83 

2'[0P1] 

48.80 

48.50 ± 1.00 

53.99 ±7.41 

47.37 ± 4.54 

2-[lDl] 

86.46 

86.41 ± 0.40 

87.22 ± 0.78 

87.13 ±0.76 

1 + [151] 

1.98 

1.95 ± 0.23 

1.80 ±0.37 

1.75 ±0.36 


Table 5. Mean and standard deviation of the measured ht fractions for Model II. 


niabc mass spectrum. A resonance is typically observed through both a significant intensity and 
corresponding phase motion in a particular amplitude. For example, Fig. m shows a significant 
intensity for the 2^ [052] state and resonant-like phase motion relative to the 1+[051] amplitude 
(these results are obtained from the BIC-LASSO regulated mass-independent fits described in the 
previous section). The purpose of this section is to show that if such a procedure can be developed 
prior to examining the data — recall that employing the BIC-LASSO permits defining fhe model 
in each Mate bin using fhe dafa ifself — if would be possible fo define fhe sfafisfical significance of 
each resonance. 

There are a number of difficulties that must be addressed by any procedure meant to identify 
resonance contributions: 

1. The functional form of the niabc dependence of the intensity and phase for a resonant ampli¬ 
tude is likely not known a priori. Resonances are often well described by the terms included 
in our model amplitudes; however, for certain cases, e.g. when multiple resonances with the 
same quantum numbers overlap in niabc, this prescription is not valid. 

2. Even in the best case scenario, there will be unknown parameters in the functional form 
discussed in (1) that invalidate the use of Wilks’ Theorem HTTl] . Obtaining the significance 
will require generafing a large ensemble of pseudo dafa sefs. 
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Figure 10. Resonant amplitude intensity for a) 1 + [051] and b) 2 [052] and c) phase difference between the 
two amplitudes. The red curves are a simultaneous fit to the three distributions described in the text. 


3. There may be multiple resonances with the same quantum numbers contributing to different 
regions of niabc', therefore, one either needs to scan in niabc and consider restricted regions, 
or perform a fit which allows for an arbitrary number of resonance contributions. 

4. Since an analysis will typically look for resonance contributions in many different ampli¬ 
tudes, each with an unknown mass, the statistical significance must account for the so-called 
trials factor flU. 

Developing a general procedure to identify resonances will be difficult, but it is certainly worthy of 
investigation. 

In this section, we provide an ad hoc procedure that works well for this specific toy analysis. 
We sfress fhaf our goal here is fo demonsfrafe fhaf if such a procedure is known a priori, if can 
be used in combinafion wifh fhe BIC-LASSO procedure to defermine fhe sfafisfical significance of 
each resonance. To idenfify a resonance in, e.g., fhe 2^[052] amplifude, fhere are fhree relevanf 
disfribufions fo consider: 1) fhe infensify of fhe 1+[051] reference amplifude, 2) fhe infensify of fhe 
2^ [052] amplifude of inferesf, and 3) fhe phase difference A(p befween fhe fwo amplifudes. The 
functional form discussed in poinf (1) above is faken direcfly from fhe frue model p.d.f., buf wifh 
fhe resonance parameters freafed as unknowns. 

The fhree disfribufions in Fig. are fif simulfaneously using minimization^, where fhe 
reference amplifude is assumed fo be a known sfafe and, fherefore, ifs mass and widfh are fixed in 
fhe fif. Poinf (3) is addressed by scanning in niabc for each amplifude, and resfricfing fhe fif range 
fo a maximum of ±0.5 and minimum of 6 niabc bins. This approach works well for fhe resonances 
included in our model; in a real analysis, one would need fo defermine fhese limifs by ofher means. 
The acfual range used for each resonance fif is derived by requiring consecufive bins wifh infensify 
more fhan 2a from zero. Tables^ and 0 report fhe measured masses and widfhs of fhe Breif-Wigner 
fifs for fhe non-reference amplifudes in fhe frue p.d.f. The LASSO greafly improves fhe resolution 
on fhe resonance paramefers. 

^We use the notation of here since the minimization quantity is constructed as the sum of the square of the 
(observed-expected)/uncertainty in each bin. This quantity, however, is not expected to follow a x~ distribution, and 
Wilks’ Theorem is not expected to be valid as discussed in point (2) above. 
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Model I 

Model II 


Generated 

No LASSO 

BIC-LASSO 

No LASSO 

BIC-LASSO 

2-[0S2] 

1.45 

1.46 ± 0.08 

1.45 ±0.01 

1.47 ±0.04 

1.45 ±0.01 

2-[0Fl] 

1.45 

1.44 ± 0.07 

1.43 ±0.01 

1.43 ±0.09 

1.45 ±0.04 

1+[151] 

1.25 

1.26 ± 0.03 

1.25 ±0.01 

1.26 ±0.04 

1.25 ±0.01 


Table 6. Mean and standard deviation of the measured mx distributions extracted from the Breit-Wigner fits. 




Model I 

Model II 


Generated 

No LASSO 

BIC-LASSO 

No LASSO 

BIC-LASSO 

2-[052] 

0.25 

0.25 ± 0.03 

0.25 ± 0.01 

0.25 ± 0.02 

0.25 ± 0.01 

2-[0Fl] 

0.25 

0.27 ± 0.03 

0.24 ± 0.01 

0.27 ± 0.03 

0.26 ± 0.02 

1 + [151] 

0.25 

0.28 ± 0.03 

0.26 ± 0.02 

0.27 ± 0.03 

0.25 ± 0.02 


Table 7. Mean and standard deviation of the measured distributions extracted from the Breit-Wigner fits. 


We now address the topic of whether an observed resonant contribution is statistically signifi¬ 
cant. For each resonant amplitude fit, a Ax^ is calculated between the x'^ of the fit described above 
and the x^ for a model with no resonant signal in the amplitude of interest. If an amplitude did not 
have 6 consecutive bins with intensity > 2a or the total phase motion, i.e., the difference between 
the minimum and maximum phase difference values in the fit range defined above, was less fhan 
7r/4 (anofher ad hoc requiremenf) fhe fil is deemed invalid and assigned Ax^ = 0. To evaluate Ibis 
procedure an ensemble of 10^ dafasefs was generated. The Ax}' disfribufion from fhis ensemble 
for fhe resonanf amplifudes in fhe frue p.d.f., 2^[05'2], 2^[0/’l], and 1+[15'1] are shown in Fig. [iT] 
a) wifhouf using fhe LASSO procedure (i.e. A = 0) and b) wifh fhe LASSO procedure using BIC 
fo selecf fhe besf A for each dafa sample and niahc mass bin. The mean of fhe Ax^ disfribufion is 
significanfly larger for all fhe resonanf amplifudes in fhe frue p.d.f. when fhe BIC-LASSO pro¬ 
cedure is used. This means fhaf a resonance will be stafisfically more significanf when using fhe 
BIC-LASSO. 

Fig. shows fhe disfribufion of fhe fraction of datasefs wifh Ax} above a given value Ax}- 
for exfraneous amplifudes in fhe fif, a) wifhouf fhe LASSO procedure and b) wifh fhe BIC-LASSO 
procedure. Only ~ 1% of fhe dafasefs fif wifh fhe BIC-LASSO procedure had any exfraneous 
amplifudes fhat safisfied fhe criferia for being assigned a non-zero Ax}. There are no dafasefs 
fif using fhe BIC-LASSO fhaf have a maximum Ax} for any exfraneous wave greater fhan 100. 
Wifhouf using fhe LASSO, almosf all dafa sefs have af leasf one exfraneous amplifude assigned a 
non-zero Ax} value. Furthermore, almost 1% have a maximum extraneous-amplitude Ax}^ > 100. 
In this example, 4a significance would require at most Ax} > 250 for the non-LASSO fits and 
> 70 for the BIC-LASSO fits. The actual Ax} requirements depend on the trials factor. 

In the previous section we showed that the model itself can be simplified without direct human 
intervention during the process which means that it is possible to get to the mass-dependent fit 
stage blindly. The type of procedure presented in this section for assigning significance could also 
be applied blindly, provided the procedure is fixed prior to examining the data. This would permit 
obtaining a p-value using pseudo data sets, since the entire procedure is automatic. Further research 
on developing a general version of such a procedure is strongly encouraged. 
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Figure 11. distribution for resonance amplitudes in the true p.d.f.: 2^ [052] (blue), 2^ [OPl] (cyan), and 
1+[151] (green); a) without using the LASSO procedure (i.e. A = 0) and b) with the LASSO procedure 
using BIC to select the best X for each data sample and niabc mass bin. 




Figure 12. Cumulative maximum Aj^ distribution from the extraneous amplitudes: a) without using the 
LASSO procedure (i.e. A = 0) and b) with the LASSO procedure using BIC to select the best A for each 
data sample and niabc mass bin. 


5. Summary 

This paper demonstrates how to effectively apply the LASSO regularization procedure to an am¬ 
plitude analysis. This procedure produces similar results in fits using only the true model p.d.f. and 
those that include a large number of extraneous amplitudes. The use of regularization removes the 
need for direct human intervention to simplify the model, making it possible to develop procedures 
for determining the significance of a resonance observation. An outline of how to assign such a 
significance was also presented. All of the techniques presented in this paper require only minor 
modifications to existing amplitude-analysis software. 
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